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1. Introduction and statement of results 



Abstract. We give upper and lower bounds on the number of graphs of fixed degree which have a positive density of 
triangles. In particular, we show that there are very few such graphs, when compared to the number of graphs without 
this restriction. We also show that in this case the triangles seem to cluster even at low density. 
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In a number of contexts involving large graphs (such as the Web or Citation Networks) it has 
been observed that such graphs contain a large number of triangles, and probably a positive 
density of them (per node). We refer to [EM] for a detailed discussion. The importance of 
topology is also mentioned in [AB], where the authors say (p.41): 

rH ! "But if the topology of these networks indeed deviates from a random graph, we need 

to develop tools and measurements to capture in quantitative terms the underlying 
organizing principles." 

On the other hand, it is well known that in several models of random graphs with a bounded 
number of links per node the probability of observing a large number of triangles is vanishingly 
small when the number of nodes diverges. A natural question is therefore to estimate more 
precisely the number of graphs with a large number of triangles. It will become clear from the 
discussion of our paper that this result is beyond the "large deviation bounds" which are found 
in the literature [VV, JLR]. 

In this paper we study the cardinality of sets of graphs with a positive density of triangles 
per node. We consider (random) graphs with sparse sets of links, i.e., random graphs in which 
the number of links is bounded by a fixed constant times the number of nodes. We will consider 
three models of labeled graphs: 

G) The model Q n k comprises the graphs with n nodes and kn links. We call them /c-general 
graphs. 

O) The model Q n k _ out is the set of all graphs with n nodes, and from each node there leave 

exactly k directed links (directed from that node). We call these graphs A;-out. 
R) The model Q n fc _ reg is the set of all graphs where at each node exactly k links meet. (This 
definition is only interesting if kn is even, which we tacitly assume in the sequel.) These 
graphs are called k -regular. 
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A well- studied question is that of the probability of finding triangles in such graphs, where 
the probability is relative to the uniform measure on the set of graphs, giving the same weight 
to each graph. For all of the above examples, it is known that the expected number of triangles 
in these graphs is bounded independently of n, by a quantity A = (k 3 ) . Furthermore, for each 
t > 1, it has been shown that the probability to find exactly t triangles is given, in the limit 
n — > oo, by the Poisson distribution 

Note however, that this limit is not at all uniform in t, as will be illustrated by our results in 
Sec. 4. Further studies have greatly refined this result, giving very precise estimates on the tails 
of this distribution, as a sort of large deviation result. Our study, in this paper, goes beyond that 
region, since we ask for the size of subsets of the three graph families with a positive density 
of triangles. We assume throughout that a is a fixed constant a > and we consider those 
graphs in the above classes which have an triangles (or, more precisely [an] triangles, where 
[x] denotes the integer part of x). We denote these subsets by Q n k a , Q^ k _ outjCt , £ n , fc _ reg , a - If 
X is a finite set we denote by \X\ its cardinality. Our main result is the following 

Theorem 1.1. Fix k E N and a > 0. For the three graph families we have the bounds (valid 
when the lower bound is non-negative): 

Hm 1 ° g|g ;> fc '" 1 = 1, (1.1) 
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n-oo l0g|^ n)fc _ reg | k(k-l) 



We conjecture that in the statement above the limits exist (assuming, of course, that kn is 
even in the A; -regular case). 

Remark 1.2: From the point of view of Information Theory or Statistical Mechanics/Large 
Deviations, the number of triangles is an extensive quantity relative to the number of nodes. But 
the logarithmic bounds we find are not extensive in the number of nodes: They are extensive and 
small on the scale (of the logarithms) of the number of graphs. This suggests that the presence 
of a positive density of triangles is a very strong information about the system. 

Indeed, imposing that the number of triangles is proportional to the number of nodes leads 
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intuitively to the conclusion that if one considers two links emanating from a common node, 
there is a non zero probability that their ends are also linked. 

Remark 1.3: We prove more precise bounds in (3.2). 

Remark 1.4: One shouldnote that a k -regular graph is more like a k/2-out graph (because each 
link is counted twice). 

Remark 1.5: The lower bounds are obtained by constructing graphs containing complete graphs 
of maximal size. We do not know whether these bounds are optimal. If they are, this would mean 
that complete graphs are "typical" among random graphs with a positive density of triangles. 

The reader should observe that the lower bound is a little surprising. Indeed, assume a > 
is very close to 0. Then, one might expect that since the density of triangles is very low, they 
will typically be (edge and node) disjoint in the set ^ n A ._ 0Ut a (and similarly for G n ^_^ a )- 
For each such triangle, once one has placed 2 of the 3 links forming it, the third link is already 
determined when we close the triangle and thus, under the assumption of disjointness, only 
nk — an links can be chosen freely [in the case of the regular graphs this number is nk/2 — an] 
leading to an upper bound of n nk ~ an . But, our lower bound is larger than that. Therefore we 
conclude: 

Remark 1.6: In the families of graphs Q n fc _ out a and Q n fc _ reg a the triangles have a natural 
tendency to coagulate into clusters. 

Another surprising result is that in the case of Q k n a of Eq.(l.l), there is no loss in the 
number of graphs on the scale of n n . This might seem all the more surprising in view of the 
Poisson distribution of the expected number of triangles (but is more credible in view of the 
non-uniform diagonal limit t = an — > oo which we are considering). 

The paper proceeds from the k -regular graphs via the k-out graphs to the /c-general graphs. 
We prove first upper bounds and then the (easier) lower bounds. 

2. Upper bound for /c-regular graphs 

This section should be considered as a warm-up for the next one. Therefore, many arguments 
are sketched, and the reader can find longer explanations in the next section. On the other 
hand, the general line of proof should be more transparent. The reader will also notice that the 
/c-regular case is much less delicate than the k-out case. 

We assume that nk is even because otherwise there are no k -regular graphs. For a k -regular 
graph, the general bound is [B]: 

\G n , k - Kg \ « C{k) n n nk ' 2 . (2.1) 



A given link cannot be an edge in more than k — 1 triangles, because exactly k links meet at 
each node (see also Lemma 3.3). For every link we say that it is s times occupied if it occurs in 
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s triangles, and every triangle occupies (in this sense) 3 links. Thus the total occupation number 
is 3cm, and therefore the number of links involved in edges of triangles is at least 3an/ (k — 1) 
(and at most 3an). We next bound the number of ways to draw an triangles. Label the nodes, 
and for every node i let t i be the number of triangles having i,j,m as corners with i the smallest 
of the three indices. The number of ways to choose the t i is 

an + n- 1\ < 2 ( Q+ i )n 



, , - - (2-2) 

71—1 

which is negligible on the scale we consider. With the t i fixed, we draw the triangles at 
i = 1,2,.... Note that in this process we will have to place at least £ = 3cm I {k — 1) links. 
Now, if we draw a triangle, several things can happen. Either the triangle is already drawn, 
because its 3 sides have been placed as sides of triangles which have been drawn earlier. No link 
needs to be placed in this case. In all other cases, between one and 3 links need to be drawn. 
The least favorable case occurs when 3 links have to be placed. Then, there are at most n(n — 1) 
possibilities to choose the first 2 links and then at most 1 possibility for the third, and we get a 
factor of n 1- ^/ 3 - 1 per link. If only one new link is used, and it starts at % (and the two others are 
already there), there are at most k 2 possible endpoints for that link and we get a factor k V" 1 in 
this case. If the one missing link is between two links (which are already there when this link has 
to be placed) we get at most k(k — l)n°/2 possibilities. Finally, if two links are missing, there 
are 2 possibilities: Either it is two links starting at i, and this makes at most nk = kn 2 ~ l ways, 
or one link starting at i and one not starting at i which makes at most nk = n 2 ~ l k possibilities. 
Indeed, there are n possibilities to choose the end j of the link starting at i and then there are 
at most k possibilities for choosing the second link. Once two links are chosen, the triangle is 
completely determined since all its nodes are fixed. 

Thus for all these links we get a bound of at most 

k 2i n e - 1 / 3 (2.3) 

possibilities, that is, k 2 n 2 ^ 3 per link. Finally, the remaining kn/2 — £ links can be put in 
at most n hn / 2 ~ e ways. Summing over the possible number of links (which is bounded by 
3an/ (k — 1) < £ < 3an and yields a factor which can be easily absorbed), and combining the 
two bounds, we get a bound 

3an/(3(fc — 1)) ^jn^kn/2—ctn/(k — l) (2 4) 

This completes the proof of the upper bound of (1 .3). □ 



Remark. A second proof could be derived from a modification of the proof for the case of /c-out 
graphs which we give below. 
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3. An upper bound for /c-out graphs 

In this section, we consider the set Q n k _ oul of graphs where each node has k out-links. The 
cardinality of this set is 

|£n,fc-outl = )t ^ ' ^'^ 

In other words, we allow for links which go back and forth between 2 nodes, but we do not 
allow double directed links in the same direction between 2 nodes. Also self-links (loops) are 
forbidden. We denote by Q n fc _ out a the subset of Q n k _ oul with [an] triangles, where triangles 
are counted as follows: Once the links are placed, their orientation is neglected and unoriented 
triangles are counted, including the multiplicity of the edges (which can be 1 or 2 by what we 
said above). For example, 3 nodes with the possible 6 directed links between them count as 
8 = 2 3 triangles. By and large, these distinctions are not very essential for the proofs we are 
going to give and other choices will work with similar proofs. 
Since, for fixed j, one has 



(m — j + \) J fm\ m? 



we see that, for fixed k, 



log|^ nfc _ out | 
lim -— = k , 

n^oo TllOgn 



which we will sometimes write in the more suggestive form 

\y n ,k-oat\ ~ 71 

To be more precise, we define the notation F(n) w n nc to mean that there are constants C 1 > 
and C 2 independent of n (but not of k) such that 

C™n nc < F{n) < C?n nc , (3.2) 

so that the error term in the limit is subexponential. Another way to say this is 

log|S n , fe _ out | =n(fclogn + 0(l)). 

Define 

Q(k) = ' 



2k(5k+l) ' 



Proposition 3.1. There is a C = C(a, k) < oo for which the quantity \Q n fc _ out a \ satisfies an 
upper bound of the form 

|Sn,-out,J < C"n»<*-°«<*» . (3.3) 



Remark 3.2: This is the upper bound of (1.2). 

To avoid the notation [an], we assume henceforth that an is an integer. We consider a 
configuration with an triangles. The triangles which can occur in a /c-out graph are of two 
types, which we call type R (for round) and type F for (for frustrated) depending on the relative 
orientation of the links. 
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Fig. 1: "Round" and "frustrated" triangles. In the first case all links "follow each other" while in the second there is 
a "reverse" (frustrated) link, the link c. The corner with a circle is called the anchor of the triangle, and the links are 
then labeled in such a way that for a round triangle the a link leaves the anchor, and the others follow in order, while for 
the frustrated triangles, the a link leaves the anchor and the b link leaves the end of the a link. These rules determine a 
unique labeling of each triangle if we require the anchor for the round triangle to be at the node with lowest number. 

We next consider the number of triangles in which a given edge can occur. Because of the 
k-oui model, edges of type b can occur in arbitrary many triangles of type F, by just connecting 
2 lines from any node to a given edge. In this respect, the /c-out model is more complicated than 
the fc-regular model. However, the other lines can occur only in a small number of triangles. 

Lemma 3.3. Bounds on the number triangles per link: 

- A link can be an edge of type a in at most k triangles of type R and in at most k — 1 triangles 
of type F. 

- A link can be an edge of type b in at most k triangles of type R. 

- A link can be an edge of type c in at most k triangles of type R and in at most k — 1 triangles 
of type F. 

Proof. Consider first the case R. The edge a can occur in at most k triangles. To see this, note 
that once a is placed, there are k edges of type b leaving its end, and then the triangle must be 
closed, so there are at most k such triangles. Since R is round, the same reasoning can be done 
for the other edges. In the case of a triangle of type F, we have already remarked that there 
is no bound possible for the link b, but we claim the others cannot be part of more than k — 1 
triangles of type F. Indeed, once link a is fixed, we need to choose another out-link to become 
link c (and then the b link is fixed). This gives k — 1 as a bound. Finally, link c can belong only 
to k — 1 triangles of type F because for fixed c there remain only k — 1 candidates for the edge 
of type a. □ 

An important consequence of Lemma 3.3 is that the number of edges belonging to at least 
one of the an triangles grows proportionally with an: 



Lemma 3.4. The number of edges £ tTiang belonging to at least one of the an triangles in a graph 
of type £ njfc _ out is bounded by 




< 3cm . 



(3.4) 



Proof. The upper bound is obvious. To prove the lower bound, note that every triangle involves 
a link of type a. Since there are k links leaving from the far end of that link, the number of 
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triangles for which this link is an a link is bounded above by 2k (k of type F and k of type R). 
Thus, at least ^ links are needed just to draw all a links. □ 

Remark 3.5: Note that an a link can be also ah or c link for many other triangles, and so the 
above argument cannot be easily improved. When k = 2 and n = 3 the complete graph forms 8 
triangles, but needs 6 links, instead of the 4 as given by the lower bound. For complete, directed, 
k-out graphs the asymptotic bound for k — > oo is 

To prove the bound of Proposition 3. 1 we give an algorithm which constructs all the graphs 
with an triangles, and perhaps a few more with more triangles, and we bound the number of 
ways in which this can be done. 

To enumerate all the cases, we first label the nodes in an arbitrary fashion from 1 to n. 
Once this is done, we consider any configuration with an triangles. We associate each triangle 
with a node as follows: Triangles of type F are associated with the node from which the a and 
c links originate. For triangles of type R we label the edges in such a way that the corner where 
the a and the c edges meet has the lowest label among the 3 corners. We call the point from 
which the a link leaves the anchor of the triangle. 

Once this is done, there will be t i triangles anchored at node i, for i = 1, . . . , n. Further- 
more, we denote by v i the number of links arriving at node % once the graph will have been 
completely constructed. Both t i and v i indicate the values at the end of constructing the graph. 

In order to construct all possible graphs with an triangles, we start by choosing the t i and 
the v i . Clearly, 



E 



t i = an , (3.5) 



l 



and therefore the number of ways of choosing the t i is bounded by 



A= ( ^ + (n -l)\ ^ 2(a+1)n _ (3 6) 



71—1 

The v i satisfy £V v i = kn, since each link arrives somewhere. The number of ways to distribute 
the ends of the kn links is therefore bounded above by 



kn+(n-l)\ < 2 ( fe+1)n 



B = [ n x _ l ' ) < 2^ +V}n . (3.7) 



We will need the following 
Lemma 3.6. The product of the v, + k satisfies 



U(v i + k) < (2k) n . (3.8) 



Proof. Since the sum of the v i + k equals 2kn, the maximal value of the product is (2kn/n) n . 

□ 
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Thus, once the t i and v i are fixed, we have lost a (negligible) combinatorial factor Cq, 
withC < 2 a+k+2 (k + 1). 

Lemma 3.7. At each node, at most k 2 round triangles and at most k(k — 1) frustrated triangles 
can be anchored. 

Proof. Consider first the round triangles. There are k outgoing links from a given node, and 
from each of their ends there are another k outgoing links and then the triangle must be closed, 
and so there are at most k round triangles. For the frustrated triangles, we first choose a pair of 
outgoing links and then the direction of the link connecting their ends. □ 

Having fixed the t^v^ we now place the triangles starting with all those anchored at node 
1, proceeding to node 2, 3, and so on, until we arrive at node n. At each node % we construct 
first all the F triangles and then all the R triangles. Assume the first s — 1 triangles have been 
drawn, and assume we are placing the next triangle anchored at node i. We will first make a 
choice of which links of the new triangle are assumed to be present. This gives 8 = 2 3 choices. 
There are 2 more choices between type F and R, (in fact less since we insist on building first 
all the F before the R). For each of these choices, we bound the maximal number of ways a 
triangle can be placed. We call these bounds F- for the frustrated triangles and R- for the round 
triangles, j = 1, . . . , 8. An upper bound on the number of ways to place a triangle (given its 
anchor) is then 

16max(F 1 ,...,F 8 , J R 1 ,..., J R 8 ) . 

When we construct a triangle at i, it will be denoted by its corners m) . If it is round 
its links are a = ij, b = jm, and c = pq with p = m, q = i. If it is frustrated, its links are 
a = ij, b = jm, and c = pq with p = i, q = m. 
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The 16 cases are represented in Table I. The second column indicates which links are new 
in forming the triangle, and the next the number of these new links. The next two columns 
indicate the maximum number of ways the given case can appear. The last column will be 
explained below. 

Proof of Table I. To prove the bounds on the multiplicative factors is just a verification. We 
indicate a few cases to guide the reader. In case 1, we place 3 links of which 2 can be chosen 
freely (the a link and then the b link), whereas the third link is then completely determined. 
Therefore, we get a factor (n — 1) (n — 2), for both types of triangles, and we bound this by n 2 
In case 2, the c link is already present. For a round triangle there are at most v i possibilities for 
a c link to end in i. The a link can be chosen in n ways, and the b link must connect the end 
of a to one of the c, and this can be done in v i ways. Thus we get a factor nv v In the case of 
the frustrated triangles, there are only k — 1 possibilities for the c link (which now originates 
at i) since one link is used as the a link: The factor is therefore at most nk. All other cases 
are discussed similarly, for example, in case 7, the c link is missing, but the a and b links are 
present, and there are k possible ends for a and another k possible ends for each of the b attached 
to a. Finally, we explain case 4 which is the critical case. In it, the b link and the c link are 
given. Since one link cannot be placed in more than n — 1 ways, the factor n is an upper bound. 
Finally, the last column of Table I is calculated as follows. Its entry is an upper bound on the 
sup of the R and F column, divided by n # links . □ 



To explain the proof of Proposition 3.1 we will first consider the simpler situation where 
the cases 4 and 8 do not appear. Indeed, in these two cases, the combinatorial factor of the last 
column in Table I is not small when n is large. In this simplified case, each time we place a 
triangle, the number of links increases by 5q and the number of possibilities is bounded by 

l6n Sq n-\v t + k) 2 . 



Since 1 < 5q < 3 we can bound this from above by 

16n s ^- l / 3 \v l + k) 2 . 

From Lemma 3. 4, we know limits on i ts:iang , so that ^2(8q) = £ tliang , and£ triang e [an /(2k) ,3 an]. 
We get, in the end, for constructing at least an triangles with ^ triang links, an upper bound of 



16 «n n 2W 3 Y[(V Z + kf kl 



The last factor is obtained by observing that there are at most k 2 + k(k — 1) triangles anchored 
at a given node by Lemma 3.7. Placing the links not involved in making triangles gives at most 
a factor n per link, and therefore, using also Eqs.(3.6) and (3.7), we get an upper bound 

n nk n ~ i '^ng/^2^ a+l ^ n l6 an 2^ k+ ^ n (2k) n4k2 . 
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The sum over the possible values of £ triang is bounded by 3an times the largest contribution 
(which occurs for £ triang = an/ (2k)), and we get a bound 

3cm • c n n nk n- an,[6k) 2 {a+l)n , (3.9) 

with 

C = 16 a 2 (fc+1) (2A;) 4fc2 . 

What about the starred cases? As is visible from Table I, there is no gain in the cases 4 
and 8. Since we count everything in terms of links, the case 8 is harmless: We have no gain, but 
we also place no link. Thus the bad case is 4. We will show that case 4 cannot occur too often, 
and thus a fixed minimal proportion of the cases will give a gain. 

In the case 4 (for a frustrated triangle) we are in the process of drawing a triangle in which 
only an a link is missing. In this case, we observe the "history" of the c link. Note that the c 
link originates at i, but its other end has an index m which can be greater or less than i. We 
distinguish several cases: 
Fl) m > i: Then there are 2 subcases. 

Fla) The link i — > m was placed when a triangle anchored at some node % < i was 

formed. Then it must have been placed as a b link. 
Fib) The link i — > m was placed when another triangle anchored at i was formed. Since 
we begin with the frustrated triangles, this must have been a frustrated triangle. Note, 
however, that when the first frustrated triangle at i is being placed, this case cannot 
occur and we must begin with the case Fla (or with a case other than case 4). 
F2) m < i: There are 4 subcases: 

F2a) The link i — > m was a b link when it was placed. 
F2b) The link was placed as a c link of an R triangle anchored at m. 
F2c) The link was placed as a c link of another F triangle anchored at i. 
F2d) The link was placed as an a link of another frustrated triangle anchored at i. Note, 
however, that when the first frustrated triangle at i is being placed, this case cannot 
occur and we must begin with one of the cases F2a-F2c (or with a case other than 
case 4). 

In the case 4 (for a round triangle) we complete a round triangle with a missing a link. In 
this case, we observe the "history" of the b link. There is only one possibility: 
R) The b link of such a triangle will connect nodes j — > m with both j and m greater than i. 
Therefore, it can only have been placed as a b link when we constructed a triangle anchored 
at a node with label % <i. 

To keep track of the conditions mentioned above, we introduce counters which "distribute" 
the gain which comes from placing a b or c link onto those further uses of this link in case 4, 
where no gain is possible. To do the bookkeeping, we introduce for every link ij a counter c^ . 
Each of these is as we start the inductive procedure to be described below: 

c ij,0 = > 

for all % 7^ j E {1, . . . , n}. If a link ij is placed for the first time and it is a b link or a c link as 
it is placed, we set c- = 2k — 2. Each time a link ij is used as a c link in one of the cases Fla 
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or F2a-F2c {and only in those) the counter c i ■ is reduced by one. The maximal number of uses 
in Case 4 of a link placed originally as a b link is 2k — 2 (used k — 1 times for the c link of an 
F triangle and another k — 1 times for the 6 link of an R triangle). Similarly, a link placed as a 
c link can be used in Case 4 another k — 2 times as a c link in an F triangle. Since the number 
of uses is less than 2k — 2, none of the counters c i • will ever become negative. 

Our last counters keep track of the occurrence of the number of times we are in case Fib or 
F2d at a given node i. At the beginning of the induction, we set a i = for alH G {1, . . . , n}. 
Each time we encounter a case among 1-3, 5-7, Fla, F2a-F2c, or R, at node i, we increase a i 
by k. Each time, we encounter case Fib or F2d, we decrease a i by 1. Note that since these 
latter cases cannot occur more than k times, and they can not occur for the first triangle at node 
i, we conclude that none of the counters a i ever becomes negative. 

We now prove recursively that at any given step of the construction after adding triangle t, 
we have a bound on the total number of possibilities which is of the form 

t 

N t < W 1 -") n- a ^r-s c ^t/(k-2) n -,J2 s a s /k Y[(l6(v ig + kf) , (3.10) 

8 = 1 

where q t is the number of links already drawn, i s is the number of the node at which the s th 
triangle is anchored and a > 0, (3 > 0, and g > will be given later on. If we can show that 
there is a positive (3 for which these inequalities hold, then we have shown a bound of the type 
of Proposition 3.1, since none of the counters ever becomes negative. 

The recursive proof starts when there is no link and all counters are equal to 0, hence the 
bound is trivially true, {N = 1). 

We now explain the action at node i. During the construction of the triangles anchored 
at i, some counters will be updated, and the bound on the combinatorial factor will evolve 
correspondingly. We now inspect the evolution of the bound during the different possible 
actions taken at step i. Assume that t — 1 triangles have been placed, and that we are placing 
now triangle t which is anchored at i t = i. 
Case 1: According to column R or F of Table I, we have 

But according to the second and third columns of Table I, the number of links increases 
by 3, thus q t = q t _ x + 3. The link jm was empty at time t — 1 and will be filled 
at time t. Therefore, c m t _ 1 = and c, m t = 2k — 2, and similarly c [m ^ t _ x = 
and c [m ^ t = 2k — 2. Finally, a it = a i t _ x + k. The other counters are unchanged. 
Therefore, we find 

n 2 - n ~ ac i m - t - l/ ( 2k ~ 2 ' ) n ~ ac ^^ t - [/{2k ~ 2) n~ ga ^ t - [/k 

< n 3(1_/3) n ~ ac ^- t/{2k ~ 2) n~ ac ^^- t/{2k ~ 2) n~ ga ^ t/k ■ n 2 -^-P)+ 2 ^+s 

~ ' (3.H) 
where [m, i] = mi or im according to the orientation of the link (R or F case). 1 Since 

q t = <7 t _j + 3, we see that N t satisfies the inductive bound provided the last factor in 

1 Note that 

- <TC jm,t-l/( 2k ~ 2 ) = - ac jm,t/( 2k ~ 2 ) + a » 
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(3.11) is < 1, which is the case if 

1 > 3/3 + 2a + £. (3.12) 

Case 2: Two new links (an a and a b) appear and the combinatorial factor is max(nw i , nk) < 
n(v i + k). The counter c - m is increased by 2k — 2 and the counter a i is increased by 
k. Also, g t = q t _ Y + 2. The bound analogous to (3.11) is therefore 

n(Vi + k) ■ n -- Jm , t -i/( 2fc - 2 ) n -^, t -i/ fc 

which proves the inductive assumption if 

1 > 2f3 + a + g. (3.13) 

Case 3: The counter c [m ^ is increased by 2k — 2 and a i is increased by k, and the inductive 
bound is 

nk n-° c l™,il,t-i/(2k-2) n -oa itt _jk 

which proves the inductive assumption if (3.13) holds. 
Case 5: The counters c m and c [m i] are increased by 2k — 2, and a i is increased by k. Therefore, 

nk- n -<Jc j"M-i/( 2A: - 2 ) n - <TC [« l ,i],t-i/( 2fc - 2 ) n -e a i,t-J k 
< n 2(1_/3) n" <TC i-. t/(2fc " 2) n _<TC ["^]. t/(2fc ~ 2) n" eai - t/fc n 1 ~ 2{1 ~ f3)+2a+e (v i + fc) , 

which proves the inductive assumption if 

1 > 2/3 + 2a + £. (3.14) 

Case 6: The counter c m is increased by 2/c — 2, and a i is increased by k. Therefore we get 

n°(Vi + k) 2 - n - ffC wm],t-i/(2*-2) n -^a iit _ 1 /fc 

< n 1 - /3 n- <Jc ^.'/( 2fc - 2 )n-^^ fc n-( 1 - /3)+CT+ ^(^ + fc) 2 , 



and also 

- <7C [m,i],t-l/( 2fc " 2 ) = - CTC [m,i],t/( 2fc " 2 ) + ° . 
and this exactly compensates the factor n 2tT . Similarly, 

-ga iit _ 1 /k = -ga itt /k + Q, 
which compensates the factor n e at the end of (3.11). 



Graphs with a Density of Triangles 



13 



which proves the inductive assumption if 

1 > (3 + a + g. (3.15) 
Case 7: The counter c [m ^ is increased by 2k — 2, and a i is increased by k. Therefore we get 

n°k 2 - n ~' TClm ' i] ' t - l ^ 2k ~ 2 ' ) n ~ ea ' i ' t ^^ k 

< n l ~P n -c lm , iht /(2k-2) n -ea iit /k n -(l-f3)+a+^ Vi + fc) 2 ^ 

which proves the inductive assumption if (3.15) holds. 
Case 8: This case occurs if we want to draw a triangle anchored at % which has appeared in an 
earlier phase of the construction (for example, if its sides are all sides of type b from 
triangles anchored at i < i). In this case, no new link, but 1 new triangle and a factor 
appear 

k 2 < (v t + k) 2 , 

and the inductive assumption evidently holds, since only the number of triangles 
increases, but not the number of links. 
The conditions we require so far on (3, a, and g are all satisfied if 

1 > 3/3 + 2a + £. (3.16) 

We now come to 

Case 4: Whenever one of the subcases Fla, Fib, or F2a-F2c applies, the link % — > m was placed 
earlier as a b or a c link, and we decrease the corresponding counter c im by one unit, and in the 
case R the counter c mi is decreased. The counter a { is unchanged in these cases. Therefore, we 
find that 

n . n -««W],t-i/(*-2) < n l-/3 n -ac [mtiht /(k-2) n f3-a/(2k-2) ^ 

and this proves the inductive assumption provided 

< a /{2k -2) . (3.17) 

The remaining cases are Fib and F2d. In these cases the counter a i becomes useful: It is 
decreased by 1 and all other counters are unchanged. Therefore, we get 

n . n -c°i,t-i/fc < n l -' 3 n- ea ^ /k n p ~ Q l k , 

and the inductive assumption holds provided 

(3 < g/k. (3.18) 

We have now discussed all cases. It remains to see that the constants (3, a, and g can be 
chosen consistently. They have to satisfy (3.16), (3.17), and (3.18). We find that the optimal 
solution is 



g = kp, a = (2k - 2)13 , and 1=3/3 + 2(2A; - 2)/3 + k(3 
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Thus, we find 

* = 2k(5k+ 1) ' 

and the proof of the inductive assumption is complete. 

Recall that to draw t triangles we need at least t/ (2k) links, and no more than 3t links. We 
therefore conclude (combining the bound on N t with (3.6) and (3.7)) that the number of graphs 
with t triangles is bounded by 

3t ■ 1 (j t 2 t+n n nk ~P t ^ 2k ^2( kJr ^ n (2k) n ( 2k2 ~ k ^ 

where the last factor follows from (3.10) and the observation that the number of triangles 
anchored at a node is bounded by 2k 2 — k by Lemma 3.7. The proof of Proposition 3.1 is 
complete. □ 



4. A lower bound for /c-general graphs 

Here, we consider the class Q n k of graphs with n nodes and kn links which can be placed 
anywhere we please. (This model is close to the well known model G n where any of the links 
is chosen with probability p = 2k /n.) 

Lemma 4.1. Fix any a > 0. The number of graphs with an triangles in the class of graphs 
with n nodes and kn links is (for large enough n) at least 



„, 2/3, . .2/3 

\Gn.kJ > e~ 0(n } n-^ \Q. 



n,k I 



Remark 4.2: This is clearly much larger than the bound of Proposition 3.1, and almost as large 



as n nk , in fact 



lim 1 ° Slg "' t ' J = * 
n^oo nlogn 



This also proves (1.1), since 

(2) ) „ n nk 

nk 



'n.k I 



Proof. Among the n nodes choose This can be done in 



71 

1 £ ' (4 ' 1) 
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ways. (Here and throughout the proof we do not worry about the integer parts.) With these 
nodes we build a complete graph, this consumes about a 2 ^ 3 n 2 ^ 3 links and gives an triangles 
(and it can be done in 1 way). Now among the remaining n — a x ^ 3 n x l 3 nodes distribute the 
kn — a 2 ' 3 ?? 2 / 3 links so that there is no triangle. The number of ways this can be done can be 
estimated from below using the well known result on the number of graphs without triangles 
(see [B]), namely this number is bounded below by 

°(<i"5'i)^" 0< " !,V,<, " >VI (®)- <"> 

Combining (4.1) and (4.2) we get the lower bound. □ 



Remark 4.3: In some sense, this result can be viewed as a complement to the large deviation 
results of Vu[VV], see also [JLR]. Consider the polynomial associated with triangles in a graph 
with n nodes: _ 

l<i<j'<m<n 

For this polynomial (in the case of the model Q' n p where links appear with probability p = k/n) 
one has 

E(Y) = 0(1) , E'(Y) = O^" 1 ) , 

with E the expectation in the random set. Taking Theorem 1.1 in [VV] and choosing A = 
0(a 1 ^ 3 n 1 ^ 2 ) one gets an upper bound of the form 

P{\Y\ > an) < e-° (a n } . 
Note that this is consistent with the lower bound of Lemma 4. 1 . 



5. A lower bound for /c-out graphs (and for /c-regular graphs) 

Consider the graphs Q n fc _ out which are of type /c-out. Recall that 

l^n,A;-outl ~ U 

We will consider graphs /C fc+1 which are complete in the sense that there are k + 1 nodes, and 
from each node k out-links are leaving (to another node of JC k+l ). Counting in this case gives 
k + 1 nodes, k(k + 1) links (one for each direction) and 8 ( fc ^ _1 ) triangles (the factor 8 accounting 
for the 8 ways to use the 6 links on each triangle, one back and one forth for each pair of nodes). 
We now distribute the an triangles into 2 



To simplify the discussion, which is in terms of orders of magnitude, we assume that all quotients are integers. 
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disjoint complete graphs K, k+1 , and this leaves 



R n = „-(fc+l)-_ 



nodes which will not have been used when making the complete graphs of type /C fc+1 . All links 
originating from the nodes of the clusters are used up in forming the /C fc+1 . The number of ways 
to place the C n complete graphs is 



a?) 



(fc+l)\. 



») — ~ „«- , (5.i) 



with 

cm(/c + 1) an 



The first factor in (5. 1) counts the number of ways to choose the nodes involved, and the quotient 
counts the number of ways the (k + l)C n nodes are grouped into clusters of k + 1 each. Since 
the graphs use k links per node, the graph we can construct with the remaining R n nodes will 
be disjoint from the C n clusters, and we want to bound the number of ways in connecting the 
remaining nodes without adding any triangles. A lower bound on the number of such graphs is 
obtained by constructing again a /c-out bi-partite graph on the remaining R n nodes. 
The number of ways to place the remaining links is therefore bounded below by 

RJA R - s 

™ 1 m n n , 
with 

an(k + V 



S„ = k\n 



8(T) 



Note that we do not insist on making a connected graph. So we find a lower bound of E n n Tn 



where E depends only on k and 



3a 1 . \ , 3a 

= n k- — (1 + rr-r) = nk 1- 



i 5|__ (fc(fc+1) _ (fc + 1) + 1) ) 



Ak y k 2 -VJ \ 4(k 2 - 1) 



Remark 5.1: The above calculation proves the lower bound for (1.2). The lower bound for 
(1.3) is an easy variant, observing the fact that instead of 8 triangles in a complete graph on 3 
nodes in the k -out model there is only 1 in the k -regular model. 
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